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ABSTRACT 

We present a group-galaxy cross-correlation analysis using a group catalog produced from the 16,500 
spectra from the optical zCOSMOS galaxy survey. Our aim is to perform a consistency test in the 
rcdshift range 0.2 < z < 0.8 between the clustering strength of the groups and mass estimates that 
are based on the richness of the groups. We measure the linear bias of the groups by means of a 
group-galaxy cross-correlation analysis and convert it into mass using the bias-mass relation for a 
given cosmology, checking the systematic errors using realistic group and galaxy mock catalogs. The 
measured bias for the zCOSMOS groups increases with group richness as expected by the theory of 
cosmic structure formation and yields masses that are reasonably consistent with the masses estimated 
from the richness directly, considering the scatter that is obtained from the 24 mock catalogs. An 
exception are the richest groups at high redshift (estimated to be more massive than 10 13 5 M Q ), for 
which the measured bias is significantly larger than for any of the 24 mock catalogs (corresponding 
to a 3ct effect), which is attributed to the extremely large structure that is present in the COSMOS 
field at z ~ 0.7. Our results are in general agreement with previous studies that reported unusually 
strong clustering in the COSMOS field. 

Key words: cosmology: observations - galaxies: clusters: general - galaxies: groups: general - galaxies: 
statistics - large-scale structure of universe 



1. INTRODUCTION 

In the current ACDM paradigm of cosmic structure 
formation, dark matter (DM) halos are biased tracers of 
the underlying matter field. That is to say, the autocor- 
relation function £ (r, M) of halos of mass M is related 
to the DM linear correlation function £iinM as 

Z(r,M)=b 2 (M)ZUr) (1) 

with b(M ) being the linear bias, which is a monotonically 
increasing function with mass (Kaiser 1984; Bardeen 
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et al. 1986; Cole & Kaiser 1989; Mo & White 1996). 
Since both functions ^linM and b(M) are theoretically 
well understood for a given cosmology, measuring the 
correlation function for a sample of halos can be used 
for several applications. If the masses of the halos are 
known, it can yield constraints on the underlying cosmol- 
ogy. If the masses are not known, adopting the current 
favored cosmological model provides information on the 
typical mass of the halos. If constraints exist for both 
the mass and the cosmology by means of independent 
measurements, an analysis of the correlation function al- 
lows a consistency test within the current paradigm of 
structure formation in the universe. 
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In this paper we perform such a consistency test at 
rcdshift 0.2 < z < 0.8 by assuming the currently favored 
ACDM cosmology to be correct and comparing the re- 
sulting masses that we obtain from a correlation function 
analysis with other independent estimates of the masses 
of the halos. The sample of DM halos is given by the op- 
tical group catalog (Knobel et al. 2012, hereafter "K12") 
which was produced using ~ 16, 500 spectroscopic red- 
shifts (the so-called 20k sample) from the zCOSMOS- 
bright galaxy survey (Lilly et al. 2007, 2009, S.J. Lilly 
et al. 2012, in preparation). In this context, a group is 
defined as a set of galaxies occupying the same DM halo 
and the zCOSMOS group catalog was constructed using 
a group-finding algorithm that was tuned by comparison 
to extensive mock galaxy catalogs from the Millennium 
simulation (Kitzbichler & White 2007). The operational 
definition of a DM halo in the Millennium simulation is 
of a friends-of-friends (FOF) group of DM particles con- 
nected with a linking length of b = 0.2 times mean in- 
terparticle separation, corresponding to structures with 
a mean overdensity of roughly 200 (see e.g. More et al. 
2011 for a recent discussion). The success of the group- 
finder in terms of the purity and completeness of the 
resulting catalog is derived by comparison with these 
same simulations for which the halo membership is of 
course known. The high purity of the zCOSMOS cat- 
alog guarantees that contaminations from fragmented, 
over-merged, and spurious groups should be small. 

Unfortunately, it is difficult to directly obtain esti- 
mates of the DM masses of the group halos from the 
optical data, e.g. by means of the virial theorem, since 
most of the groups have just a few identified members. 
For this reason, we introduced an estimated mass that 
was based on the observed richness of the group, cor- 
rected for variations in the spatial sampling rate (SSR) 
of the galaxies, and calibrated against the same simula- 
tions. We referred to this estimated mass as the "fudge 
mass" . The aim of this paper is to examine whether the 
clustering properties of the groups are consistent with 
them having these masses in reality. 

The most straightforward way to perform the consis- 
tency test would be by directly estimating the autocor- 
relation function of the groups for a given mass range. 
However, this would require detailed knowledge of the 
spatial selection function of the groups, which will likely 
depend on the observed richness. Clearly a poor group 
will suffer more from any spatial variation in the spec- 
troscopic sampling than a rich one, which will have been 
recognized even if several of its members were missed. 
For this reason, it is preferable to measure the cross- 
correlation function between groups and galaxies instead, 
i.e. the group-galaxy cross-correlation function. In this 
case, only the well understood spatial selection function 
of the 20k galaxy sample is needed and we can avoid wor- 
rying about the more complex spatial selection function 
of the groups. 

In this paper, we measure the group-galaxy cross- 
correlation function and the galaxy autocorrelation func- 
tion to perform a consistency test by estimating the 
group bias in the linear regime and comparing the re- 
sults to our richness-calibrated masses. This analysis 
should not depend on the choice of the "galaxy" sample, 
since the bias of this sample drops out from the anal- 
ysis. We check this by carrying out the analysis with 



both magnitude- and volume-limited galaxy samples ob- 
taining consistent results. Furthermore, we will perform 
the analysis in parallel on simulated mock catalogs in or- 
der to test our codes, explore systematics, and to obtain 
an idea of the impact of cosmic variance. Not least, we 
can explore whether the large COSMOS field is consis- 
tent with the predictions of the Millennium simulations 
and/or whether it is representative of other regions of 
sky. This was previously investigated in a couple of stud- 
ies (McCracken et al. 2007; Meneux et al. 2009; Kovac 
et al. 2010; de la Torre et al. 2010, 2011) with the result 
that the clustering in the COSMOS field is unusually 
strong compared with simulations and other surveys. 

The group-galaxy cross-correlation function was first 
measured by Seldner & Peebles (1977) for Abell clusters. 
In the last decade, it was measured in the local universe 
for 2dfGRS and SDSS group-galaxy samples (Yang et al. 
2005; Berlind et al. 2006; Mountrichas & Shanks 2007; 
Wang et al. 2008) and for DEEP2 at z ~ 1 (Coil et al. 
2006). 

This paper is organized as follows. In Section 2, we 
briefly review the halo model in the linear regime, which 
is the theoretical basis of this paper. In Section 3, we 
describe the group and galaxy samples that are used in 
the analysis. The method of the correlation function es- 
timation is described in Section 4 and in Section 5 we 
discuss the resulting correlation functions for the actual 
data and for the mock catalogs. In Section 6, we derive 
the group masses by means of their bias and compare 
our results with the mock catalogs. Section 7 concludes 
the paper and summarizes our findings. 

Where needed, the concordance cosmology of the 
mock catalogs is adopted, i.e. Hubble constant H = 
100ft, km s _1 Mpc -1 with h = 0.73, matter density 
fi TO = 0.25, cosmological constant ft\ = 0.75, spectral 
index n s = 1, and linear fluctuation strength a$ = 0.9. 
Although this value of erg = 0.9 is now thought to be a bit 
high (e.g. Komatsu et al. 2011), it should be remembered 
that the point of the paper is to check the consistency 
with the estimates of halo mass from K12 that were cal- 
ibrated using these same simulations. Also, the effect of 
(is on £(r, M) and on b(M) goes in opposite directions 
(see the discussion in Sect. 6.3). Throughout this paper, 
we refer to distances in comoving hr x Mpc and to masses 
in units of log(A//Af Q ) explicitly assuming h = 0.73. We 
use the term "dex" to express the antilogarithm, i.e. 0.1 
dex corresponds to a factor 10 01 ~ 1.259. 

2. THE HALO MODEL 

The principle of using clustering properties of partic- 
ular objects to infer their DM halo masses is well estab- 
lished in terms of the "halo model" (Peacock & Smith 
2000, Seljak 2000, see e.g. Cooray & Sheth 2002 for a re- 
view), and is only briefly reviewed here. The halo model 
is based on the following assumptions: first, all galaxies 
reside within DM halos following a certain spherical sym- 
metric density profile, where there is always a galaxy at 
the center of the halo. Second, the distribution p(N\M) 
of the number of galaxies in halos of mass M, which is 
called "halo occupation distribution" (HOD), depends 
for a given galaxy sample only on the mass M of the 
halo. Typical further assumptions are that the galaxy 
density profile within halos follows that of the DM and 
that central and satellite galaxies within halos constitute 
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different galaxy populations. For the analysis in this pa- 
per, we only need the assumptions that galaxies reside in 
halos and that the bias of the galaxy population, which 
comes from the HOD, does not vary greatly within a 
rcdshift bin. 

For two samples of galaxies g and g' with comov- 
ing number densities n g (x) and n g '(x), respectively, the 
overdensity for either sample is 



Si(x) 



m(x) 



i e {9,9'} 



(2) 



with hi = (ni(x)). The brackets () denote the cosmic av- 
erage which is taken to be the volume average by assum- 
ing ergodicity or a "fair sample hypothesis" for the cor- 
responding density field (see e.g. the discussion in Watts 
& Coles 2003, Sect. 4.3). Since zCOSMOS is too small 
to constitute a "fair sample" especially at small redshift, 
the corresponding averages are affected by cosmic vari- 
ance. 

The cross-correlation function £ gg ' (r) between these 
two samples g and g' is defined by 



£ gg ,(r) = (6 g (x)5 g ,(x')) = 



(n g (x)n g ,(x')) 
n g n g > 



- 1 



(3) 



with r = \x' — x\ and is a measure of the excess of gg' 
pairs at separations r compared to their mean number 
densities. The reason that £ gg < (r) depends only on the 
separation r = |a?2 — Xi\ is a consequence of the basic 
cosmological assumption of homogeneity and isotropy. 

From the definition of the correlation function and the 
assumption that all galaxies reside within DM halos, it 
follows immediately that the cross-correlation function 
divides into two terms, i.e. 



.(hi) 



(h2). 



(4) 



where the "one-halo term" Q g , contains the contribu- 
tion from galaxy pairs within the same halo and the 
"two-halo term" £ g g/ from pairs within different halos. 
For scales which are much larger than the typical exten- 
sion of a halo (< 1 Mpc) the one-halo term can be 
neglected and the correlation function becomes approxi- 
mately equal to the two-halo term. In this paper, we will 
consider only the linear regime where the one-halo term 
is negligible. 
The two-halo term can be approximated by 



(5) 



where we have introduced the linear bias b g and b g > for 
the two galaxy samples, respectively, being defined as 

d ^(M) K M)^l d M (6) 

d ^(M )b (M)^m d M (7) 
dm riff/ 



b g ,= 



with dhh/dM being the mass function of the DM halos 
and (Ni\M), i = {g,g'}, the mean number of galaxies in 
halos of mass M determined by the corresponding HOD. 
In Eq. (5) we have neglected the extensions of the halos 
and formally just placed all galaxies at the centers of the 



corresponding halos, which is a good approximation for 
scales much larger than the extension of the halos. 

The specific expressions for the galaxy ( u gg" ) autocor- 
relation and the group-galaxy ("Gg") cross-correlation 
function, which we will need for our analysis, are imme- 
diately obtained by means of Eqs. (5)- (7) as 



with 



£gg( r ) - & g &n(r) 
£ Gg (r) ^ b G b g £u„(r) , 



f G (M)^(M)b(M)dM 
f G (M)^(M)dM 



(8) 
0) 

(10) 
(11) 



where /q(M) is the completeness of the group sample 
with respect to the total halo population. 

3. DATA 

In this section, we describe the data that were used 
for the analysis in this paper. We summarize in turn 
the zCOSMOS survey from which the data are taken, 
the properties of the zCOSMOS group catalog, and the 
construction of realistic mock data samples. Finally, we 
describe the group and galaxy samples that were adopted 
for our correlation function analysis. 

3.1. zCOSMOS survey 

zCOSMOS (Lilly et al. 2007, 2009, Lilly et al. 2012 
in preparation) is a deep spectroscopic galaxy survey on 
the 1.7 deg 2 of the COSMOS field (Scoville et al. 2007) 
which utilized about 600 hr of ESO Very Large Tele- 
scope service mode observations. It is divided up into 
two parts, "zCOSMOS-bright" and "zCOSMOS-deep" , 
whereby this work is entirely based on the bright part, 
which is now complete and contains spectra in the red- 
shift range 0.1 < z < 1.4 for about 20,000 objects taken 
using the VIMOS spectrograph. 

The target selection in zCOSMOS-bright is essentially 
magnitude limited by 15 < Iab < 22.5. The slits were 
assigned to the targets such that for each mask the num- 
ber of slit assignments on each of the four VIMOS quad- 
rants was maximized except for some X-ray and radio 
objects which were observed at high priority. Since there 
were two masks per pointing and the pointings were over- 
lapping with centers differing by the size of a quadrant, 
there were finally eight passes for the central field, four 
at the borders, and two at the corners. About 2% of 
these spectra were taken for secondary objects, i.e. ob- 
jects that were not target objects but serendipitously 
ended up in the slits. They are not only very help- 
ful for estimating the accuracy and verification rate of 
rcdshifts, but also compensate for the bias against close 
pairs due to slit constraints (de Ravel et al. 2011; Kam- 
pczyk et al. 2011). After removing less reliable redshifts 
(i.e. confidence classes 0, 1.1, 2.1 and 9.1; see Lilly et al. 
2009) and spectroscopic stars, we end up with a high- 
quality rcdshift galaxy sample containing about 16,800 
objects within the area 149.47° < a < 150.77° and 
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Figure 1. Number of galaxies and groups as a function of red- 
shift. Upper panel: distribution of zCOSMOS galaxies in the cen- 
tral area. The blue line shows the smoothed distribution that was 
used for the production of the "randoms" (see the text). Lower 
panel: distribution of the zCOSMOS groups that were used for the 
cross-correlation analysis. The histograms correspond to different 
mass bins as indicated by the labels in units of log(M/MQ). The 
groups in the range 12 < log(M /Mq) < 12.5 are omitted for clar- 
ity. The three dotted vertical lines mark the two adopted redshift 
bins. 

1-62° < 5 < 2.83°. From multiply observed objects the 
spectral verification rate for this sample is about 99% 
and the redshift accuracy about 100 km s _1 . 

The spatial sampling rate (SSR), i.e. the fraction of 
objects of the magnitude-limited target catalog whose 
spectra were observed as a function of (a, 5), is shown in 
Figure 1 of K12. In the design of zCOSMOS there is a 
central region (a = 150.12 ± 0.54° and 5 = 2.22 ± 0.46°) 
with substantially higher SSR than in a region around 
the borders, and we will restrict our analysis to this more 
highly sampled region. However, even within this cen- 
tral region, the SSR is not completely uniform, but ex- 
hibits some stripes due to the placement of slits in the 
masks. It is obviously very important to take these into 
account especially on the smaller scales. Additionally to 
the SSR we have to consider the redshift success rate 
(RSR), which is the fraction of observed spectra yielding 
successfully measured redshifts (see Figs. 2 and 3 of Lilly 
et al. 2009), since this affects the redshift distribution 
N(z) of the galaxy sample. The SSR and RSR can be 
assumed to be uncorrelated so that by multiplying them 
we obtain for each galaxy the sampling rate with respect 
to an ideal survey. The central area of zCOSMOS-bright 
has a mean completeness of 56%. 

The distribution of zCOSMOS galaxies in redshift is 
shown in Figure 1. There are two prominent features at 
redshifts ^3.5 and ~0.7. The large overdensity at high 
redshift is additionally contrasted by strong undcrdcn- 
sities at z ~ 0.6 and z ~ 0.8 making the structure at 
z ~ 0.7 dominant over the redshift range 0.5 < z < 0.8. 

3.2. Group catalog 

The zCOSMOS 20k group catalog is described in detail 
in K12. The basic catalog was produced by a Friends-of- 
Friends (FOF) multi-run scheme (see Knobel et al. 2009) 
in which we successively used different group-finding pa- 



rameters, optimized for different richness groups. Based 
on realistic mock catalogs, the completeness and purity 
of groups with three or more observed members are both 
about 83% with respect to all groups that should have 
been detectable within the survey. More than 75% of the 
cataloged groups should exhibit a one-to-one correspon- 
dence to a real group in the sky (see Figs. 3 and 5 of 
K12). 

The purity of the group catalog can be further en- 
hanced by selecting only the groups that are also in- 
dependently detected by the Voronoi-Delaunay method 
(VDM) . The set of FOF groups that are independently 
identified by VDM such that the corresponding recon- 
structed FOF and VDM groups exhibit a one-to-one cor- 
respondence (or two-way match, "2WM") is called the 
"GRP2 subcatalog" , which we will use for our analysis. 
Its purity is ~ 85% for groups with at least three observed 
members, but the completeness is lower. 

Obtaining reliable observational estimates for the dy- 
namical masses of individual groups is hard. This is why 
we introduced the "fudge masses" Mf u d gc , which are mass 
estimates based on the richness of the groups and cali- 
brated against the mock catalogs. That is, for a given 
group at redshift z we correct the observed spectroscopic 
richness for the SSR and RSR and assign it the average 
real halo mass of 2WM mock groups (see below) with 
the same corrected richness at the same redshift. The 
scatter between the fudge masses and the masses of the 
corresponding haloes in the mock catalogs is about 0.3 
dex for mock groups with three members and drops to 
about 0.15 dex for mock groups with more than 10 mem- 
bers (see Tab. 7 of K12). The analysis in this paper aims 
to relate the fudge masses to the clustering strength of 
the corresponding groups. 

3.3. Mock catalogs 

We utilize the same mock catalogs that were used in 
K12 to produce the zCOSMOS group catalog, and for 
details we refer to that paper. The mock catalogs are 
adapted from the COSMOS mock light cones (Kitzbich- 
lcr & White 2007) which are based on the Millennium 
DM iV-body simulation (Springel et al. 2005) run with 
the cosmological parameters Vl m = 0.25, fl\ = 0.75, 
tt b = 0.045, h = 0.73, n s = 1 and <r 8 = 0.9. The 
semi-analytic recipes for populating the DM halos with 
galaxies are that of Croton et al. (2006) as updated by De 
Lucia & Blaizot (2007). There are 24 essentially indepen- 
dent light cones, each covering an area of 1.4 degx 1.4 deg 
with an apparent magnitude limit of r < 26 and a red- 
shift range of z < 7. 

The mock catalogs were created to resemble as much 
as possible the actual 20k sample. That is, we applied 
the SSR and RSR of the 20k sample to the light cones 
by randomly removing galaxies from the magnitude lim- 
ited sample according to their completeness, and we in- 
troduced a Gaussian redshift measurement error with 
a z = 100(1 + z)/c km s _1 for each galaxy. 

The mock group catalogs were then produced in ex- 
actly the same way as the actual group catalog was 
produced from the zCOSMOS data (see K12). Since 
within the simulations we know which galaxies belong to 
which groups, we make the differentiation between "real 
groups" in the mock catalogs and the "reconstructed 
groups" that are identified by the groupfinder. Rccon- 
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structed mock groups with a 2WM to real mock groups 
(i.e. groups that are properly identified) are called "2WM 
groups". The statistical properties, such as purity and 
completeness, of the reconstructed mock groups, as de- 
termined by comparisons with the real mock groups, are 
then assumed to be indicative of the statistical properties 
of the actual identified groups in the zCOSMOS data. 

3.4. Data samples 

We restrict the analysis in this paper to the central re- 
gion of the zCOSMOS field which contains about 13,000 
galaxies and has a mean completeness of 56% with re- 
spect to a fully magnitude-limited survey. The central 
region is also where most of the groups are found, where 
they have the highest quality, and where they are the 
least affected by the border of the survey. We consider 
in the following two redshift bins: one at high redshift 
0.5 < z < 0.8 and one at low redshift 0.2 < z < 0.5. 

It should be noted that the transverse size of the sur- 
vey is much larger for the high-redshift bin than for the 
low-redshift bin. At z = 0.5 the transverse size of the 
survey along the declination A<5 = 0.92° is about 22 
Mpc, while at redshift z = 0.2 it is only about 9 h^ 1 
Mpc. These lengths define the largest transverse scales 
for which we can hope to reliably measure the correla- 
tion function. Due to the larger transverse size at high 
redshift, the total volume of the survey is substantially 
larger resulting in a higher number of massive groups 
compared with the low-redshift bin. 

Since the completeness of the group samples is not cru- 
cial for our analysis, we optimize our group sample for 
purity by adopting the GRP2 subcatalog including all 
groups with at least three observed members. At high 
redshift, we divide the group population into bins of 0.5 
dex in fudge mass Mf u d go - At low redshift, the correla- 
tion signal is much more difficult to measure due to the 
relatively small volume. For this reason to have enough 
groups in each mass bin we divide the group population 
into two mass bin separated by Mf u d g0 = 10 13 i\/©. The 
resulting group samples are summarized in Table 1 and 
the redshift distribution of the groups is shown in Figure 
1. 

To cross-correlate the groups with the galaxies, we use 
the zCOSMOS 20k galaxy sample in the same area and 
in the same redshift bins. To check that the estimated 
bias of the groups does not depend on the adopted galaxy 
sample, we produce a magnitude-limited and a volume- 
limited galaxy sample for each redshift bin. The volume- 
limited samples are obtained by selecting the galaxies in 
absolute magnitude Mb as 

M b < M b j im - z , (12) 

where the redshift z is to take into account, at least ap- 
proximately, the general evolution of galaxy luminosity 
since redshift z ~ 1. The absolute magnitude limit for 
the high-redshift bin is Mbji m = —20 and for the low- 
redshift bin Mbji m = —19. The galaxy samples are sum- 
marized in Table 2. 

4. CORRELATION FUNCTION ESTIMATION 

In comoving space the correlation function is just a 
function of the separation r = \x2 — x\\. However, in 
redshift space s the peculiar velocities of groups and 



Table 1 

Group samples 



0.5 < z < 0.8 



0.2 < z < 0.5 



Mass bins 12.5-13 13-13.5 >13.5 < 13 > 13 



No. of groups 



53 



61 



21 



110 



60 



Note. The masses are in units of log(M/M0). 

galaxies distort the correlation function along the line of 
sight, while perpendicular to the line of sight it remains 
unaffected. Thus we have to deal with the "projected 
correlation function" 



w(r p ) 



dsn 



£(ir,r p )dii, (13) 



where we set tt = and r p = \s±\ for the decompo- 
sition s = sii + sj_ of a rcdshift-space vector s into its 
components along and perpendicular to the line of sight, 
respectively. We will refer to both £ and w as the "corre- 
lation function" if it is clear from the context which one 
is meant. 

To estimate the autocorrelation function for a galaxy 
sample, the number of pairs for a given separation is 
compared to the corresponding number of pairs r and 
r' for an uncorrelated random catalog (the "randoms"), 
which is subjected to exactly the same observational se- 
lection function as the actual galaxy sample. We use the 
estimator given by Landy & Szalay (1993) 

N gg n~ 2 + N rl h- 2 - N^n^n- 1 

Cbk(Ti»p) = — — , (14) 

N„n r 

where N gg is the number of galaxy-galaxy pairs (gg), iV rr 
is the number of random-random pairs (rr) 1 and N gT is 
the number of galaxy-random pairs (gr) each time with 
separations in the ranges [7r, tt + dir] and [r p ,r p + dr p ]. 
Note that only distinct pairs are counted. 

For the cross-correlation estimation we could easily 
generalize this same estimator (14). However, it contains 
a term N rr which would require us to produce also a ran- 
dom catalog for the groups. Since the selection function 
for the groups is expected to be more complicated than 
that for the galaxies (e.g. depending on their richnesses), 
we want to avoid producing a random catalog for the 
groups. For this reason we use a generalization of the 
estimator given by Peebles (1973; see also Yang et al. 
2005; Wang et al. 2008) 



Gg 



N Gr n g 



- ^ 1 



(15) 



where N Gg is the number of group-galaxy pairs (Gg) and 
Nqt is the number of group-random pairs (Gr) each time 
for separations in the ranges [tt , 7r + dn] and [r p , r p + dr p ] . 

The randoms were distributed according to the SSR 
using the same selection function as for the produc- 
tion of the mock catalogs, which has a resolution of 
1.5 arcmin. The scales affected by this resolution are 
r p IS 1 Mpc even at the highest redshifts considered. 
The redshift distribution of the randoms was taken to 
be the (normalized) product of dN g (z)/dz, which is the 
mean number of galaxies per redshift for either an ideal 
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Table 2 

Galaxy samples 







Magnitude-limited 


Volume-limited 






Selection No. of galaxies 


Selection No. of galaxies 


0.2 < 


2 < 0.5 


I A b < 22.5 4712 


M B (z) = -19 -z 2666 


0.5 < 


z < 0.8 


J AB < 22.5 4445 


M B (z) = -20 -2 2212 



magnitude- or volume-limited sample, and the RSR (see 
blue line in Fig. 1 for the magnitude-limited case). For 
the magnitude- limited sample we estimated dN g (z) / dz 
from the Iab < 22.5 magnitude-limited target catalog by 
means of a Vmax like method using photometric rcdshifts 
and for the volume-limited sample we made the approx- 
imation dN g {z)/dz (x dV(z)/dz with V(z) the volume of 
the survey up to redshift z. We checked that the esti- 
mated bias neither depends sensitively on the resolution 
of the SSR mask nor on the details of the chosen redshift 
distribution. 

For each pair of points (i.e. gg, gr, rr, Gg, and Gr) 
we estimated the separation by tt = \d± — d%\ and 
r p = (d\ + cfe) tan(f?/2), where d\ and d,2 are the co- 
moving distances to the points and 9 their relative an- 
gle projected on the sky (Davis & Peebles 1983). The 
binning for £ gg (and analog for £cg) was taken to be 
logarithmic in both coordinates and the projected cor- 
relation functions w gg were estimated by approximating 
the integration along it by a sum over 13 logarithmic bins 
ranging from 0.1 hr 1 Mpc to 13 hr 1 Mpc. The effect of 
the choice of the integration range is discussed in Section 
6.2. We also checked that our result is insensitive to the 
implemented binning and integration. 

The error bars for the correlation function arc com- 
puted by the scatter of the correlation functions of the 
24 mock catalogs. It was shown that the error bars of 
the gg correlation function estimated by blockwise boot- 
strapping are of the same order as the scatter among the 
mock catalogs (Meneux et al. 2009; de la Torre et al. 
2010). For the Gg cross-correlation functions the group 
samples would be too small for blockwise bootstrapping 
and this procedure would anyway not take into account 
cosmic variance, which is not negligible even in the large 
COSMOS field. 

5. THE CORRELATION FUNCTIONS 

The measured correlation functions arc shown in Fig- 
ures 2-6. We have estimated the correlation functions 
for the magnitude- and volume-limited samples in both 
redshift bins for the zCOSMOS 20k sample and for each 
of the 24 mock catalogs. In the following, we first discuss 
the results for the mock catalogs to understand how the 
group selection affects the correlation function estimates 
and then we discuss the results for the actual data. 

We show only the correlation functions for the 
magnitude-limited samples. The correlation functions 
for the volume-limited galaxy samples are qualitatively 
similar, but with slightly higher amplitude, since brighter 
galaxies reside predominantly in more massive halos, 
which arc more biased. The estimated bias is however 
provided for both samples in the next section. 
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Figure 2. Autocorrelation function for the magnitude limited 
galaxy samples for the two redshift bins. The red line shows the 
20k data and the black solid line shows the mean of the 24 mock 
correlation functions, where the error bars indicate their standard 
deviation. The dashed black line shows the linear correlation func- 
tion model with adjusted normalization. For comparison the cor- 
relation function for a particular mock is also shown (green line). 
The vertical dashed line marks the smallest transverse box size for 
the corresponding redshift bin. 

5.1. Mock correlation functions 

The gg correlation functions for the mock catalogs are 
shown in Figure 2. The black solid line is the mean of the 
24 mock correlation functions, where the error bars show 
the standard deviation among the 24 mock catalogs. The 
dashed black line is the linear correlation function for the 
cosmology of the mock catalogs, where we adjusted the 
normalization to that of the measured correlation func- 
tion. It is obvious that for scales >3 hr 1 Mpc we are in 
the linear regime and the mean of the estimated correla- 
tion functions follow very well the linear model. The ver- 
tical dashed line marks the minimal transverse box size 
of the survey for the corresponding redshift bin. This is 
the scale where we expect our estimation to fail since we 
cannot measure density fluctuations on scales compara- 
ble to the size of the survey. Beyond this line the error 
bars blow up and the different curves start to diverge 
from each other. This effect is even more pronounced for 
the Gg correlation functions. 

Figures 3 and 4 show the mock Gg correlation functions 
for the high- and the low-redshift bin, respectively. We 
consider several group samples, which are all sub-samples 
of the corresponding GRP2 mock group catalogs with 
at least three observed members, and for each of these 
samples we show the mean of the 24 corresponding mock 
correlation functions. These figures illustrate how the 
estimated correlation functions depend on the selection 
of groups. The behaviors of the different curves in both 
redshift bins are qualitatively similar and the main trends 
can be summarized as follows. 

The black solid curve corresponds to the group-galaxy 
cross-correlation functions of the real groups in the mock 
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Figure 3. Group-galaxy cross-correlation functions for several mock group samples at 0.5 < z < 0.8 using the magnitude-limited galaxy 
sample. The vertical panels correspond to different mass bins as indicated by the labels in units of log(M/M0). In the upper panels, the 
curves (except dash-dotted) show the mean of the 24 mock correlation functions for different group samples, which are subsamples of the 
GRP2 subcatalog with at least three observed members. The black solid line corresponds to real groups (i.e. real masses and real centers), 
the dashed line to reconstructed 2WM groups (i.e. fudge mass and estimated centers), the red solid line to all reconstructed groups, and 
the dotted line to the spurious groups. For comparison the dash-dotted line shows the linear correlation function model with adjusted 
normalization. In the lower panels, the relative difference of the correlation functions to the correlation function of the real groups (black 
solid line) is shown. The shaded area always corresponds to the standard deviation among the 24 samples of the real groups and the vertical 
dashed line marks the smallest transverse box size for this redshift bin. 
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Figure 4. Group-galaxy cross-correlation function for several 
mock group samples at 0.2 < z < 0.5 using the magnitude-limited 
galaxy sample. The different panels are for different mass bins as 
indicated by the labels in units of \oi>(M/Mq). The lines have the 
same meaning as in Figure 3. 

(i.e. using real group positions and real halo masses 
within the mock catalogs) and constitutes an ideal group 
sample. Like for the gg correlation functions they show 
a fair agreement to the slope of the linear correlation 



function (dash-dotted line) for scales >3/i _1 Mpc. The 
dashed black curve is the cross-correlation function for 
the reconstructed 2WM mock groups (i.e. using the es- 
timated group centers and their "fudge" masses). Since 
the estimates for the group centers feature a typical off- 
set of ~0.1 /i _1 (l + z) Mpc from the real group centers 
(see Fig. 18 of K12), they do not affect our results in the 
linear regime. The differences between the solid and the 
dashed curves are of the order 5% in the linear regime, 
and the selection by fudge mass for 2WM groups basi- 
cally has a negligible impact on the correlation function 
compared with the error bars. 

We now look at how the non-2WM groups (i.e. the frag- 
mented, overmerged, and spurious groups) affect the cor- 
relation function estimates. The red solid line shows the 
correlation function for all reconstructed groups within 
the corresponding (fudge) mass range within the mock 
catalogs, which can be compared to the correlation func- 
tions for the actual data. They are typically about 20% 
higher than the real group correlation function (given by 
the black solid line), but with the right slope on average. 
The differences to the correlation function of the 2WM 
groups (black dashed line) must be caused by the non- 
2WM groups. It can be seen that the cross-correlation 
function for the spurious groups (dotted lines) are typ- 
ically significant higher than the other correlation func- 
tions suggesting that these spurious "groups" are being 



Knobel et al. 



"identified" by the groupfinder in high-density environ- 
ments which increases the cross-correlation of these spu- 
rious groups with the galaxies. Note that there is no 
dotted line shown for the highest mass bins, since there 
are essentially no spurious groups in our mock samples 
at these masses. 

It is worth noting that the upturn in the cross- 
correlation function in the nonlinear regime is much more 
pronounced for the highest mass groups in the mock cat- 
alogs. This is due to the strength of the "one-halo" term 
for the richer groups. 

5.2. zCOSMOS cross- correlation functions 

The gg-correlation functions for the actual data are 
shown in Figure 2 (red lines). For both redshift bins 
they follow a nice power law, but their slopes are shal- 
lower than those for the mock catalogs. This was already 
pointed out by Meneux et al. (2009) and de la Torre et al. 
(2011) for the zCOSMOS 10k sample by comparing their 
results to correlation functions from VVDS and SDSS. 
In a further paper de la Torre et al. (2010) suggested 
that the change in slope might be caused by the 10% 
of galaxies residing in the most dense environments. By 
excluding this 10% of high-density environment galaxies, 
they finally obtained consistent results. 

We will follow a different route here and discuss the 
question whether the real 20k sample could statistically 
be regarded as just another mock realization with respect 
to the clustering of galaxies. The green lines in Figure 
2 each correspond to one of the 24 mock catalogs whose 
correlation function is relatively close to that of the real 
data. Although both green lines are by no means typical 
for the mock catalogs as a whole (there are about one 
or two like this in each redshift bin, from the set of 24 
mock catalogs) they show that the amplitude as well as 
the slope of the correlation functions for the actual data 
could be due to cosmic variance with a probability of 
5%-10% corresponding to a 2a effect. Compared to the 
mock catalogs, the COSMOS field is unusual but not 
exceptional. 

The zCOSMOS Gg cross-correlation functions are 
shown in Figures 5 and 6. The error bars correspond 
to the standard deviation among the 24 mock catalogs. 
Most of the observed Gg correlation functions follow 
nicely the linear correlation function model wn n (dashed 
line, see next section) in the linear regime (filled points). 
For comparison also the mean Gg correlation functions 
for all reconstructed mock groups are shown (black solid 
line, transposed from the red lines in Figs. 3 and 4). 
While for the two lower mass bins at high redshift, 
the cross-correlation function is in reasonable agreement 
with that found in the mock catalogs, the discrepancy is 
quite large for the highest mass bin. We will return to 
this point later. 

Interestingly, the agreement between the correlation 
functions for the actual data (for the autocorrelation 
functions as well as cross-correlation functions) and for 
the mock catalogs is very good at small separations for 
all mass bins (including the highest mass bin at high red- 
shift). This is particularly clear in Figure 6, in which a 
significant detection of the one-halo term is visible for 
the higher mass bin. The excess in the correlation func- 
tion for the actual data, relative to the mock catalogs, 
occurs at large scales in the linear regime. 
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Figure 5. Group-galaxy cross-correlation functions for the zCOS- 
MOS 20k sample (red points) at 0.5 < z < 0.8. The different panels 
correspond to different mass bins as indicated by the labels in units 
of \og(M / Mq). The filled points mark the linear regime and were 
used for the estimation of the bias. The dashed curve shows the 
linear model uii; n with fitted amplitude and the solid black curve 
shows the mean of the 24 mock cross-correlation functions (i.e. red 
lines in Fig. 3). The error bars indicate the standard deviation of 
the correlation function of the 24 mock catalogs and the vertical 
dashed line marks the smallest transverse box size for the corre- 
sponding redshift bin. 
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Figure 6. Group-galaxy cross-correlation functions for the zCOS- 
MOS 20k sample (red points) at 0.2 < z < 0.5. The different panels 
correspond to different mass bins as indicated by the labels in units 
of \og(M /Mq). The filled points mark the linear regime and were 
used for the estimation of the bias. The dashed curve shows the 
linear model uii; n with fitted amplitude and the solid black curve 
shows the mean of the 24 mock cross-correlation functions (i.e. red 
lines in Fig. 4). The error bars indicate the standard deviation of 
the correlation function of the 24 mock catalogs and the vertical 
dashed line marks the smallest transverse box size for the corre- 
sponding redshift bin. 



6. MASS ESTIMATION 

In this section, we present the estimated bias for the 
groups. We first describe the method of estimation, then 
we perform a consistency test using the mock catalogs to 
explore possible systcmatics, and finally we discuss the 
measured masses from the bias in the actual data. 

6.1. Estimation method 

To estimate the linear group bias 6q we compare the 
estimated correlation functions to a model correlation 
function, which is a scaled version of the projected linear 
correlation function w;ii n for the cosmology of the mock 
catalogs. To compute it we use the linear power spectrum 



V Vln (k,z) = Ak n °T 2 (k)D 2 (z) , 



(16) 



with T(k) being the transfer function, D(z) being the lin- 
ear growth function, n s being the spectral index, and A 
being the normalization constant depending on o~g. We 
take the fitting formula for the transfer function from 
Eisenstein & Hu (1999) using the iCosmo software pack- 
age 23 (Refregier et al. 2011). The linear correlation func- 
tion £ii n is then obtained by the Fourier transformation 



1 



(2tt) 3 
1 



V na (k,z) e lkr dk 3 



, 9 ^ /, \ sin(fcr) „ 
k 2 V iin (k,z)^^dk 
Z7r z J kr 



(17) 
(18) 



and the corresponding projected correlation function by 
integration along ir 



Wlin(r p ,z)=2 Clin^TT 2 + r 2 , , 



G?7T 



(19) 



where the integration limits are the same as for the esti- 
mated correlation functions, i.e. 7r m ; n = 0.1 hr x Mpc and 

TTmax = 13 h^ 1 MpC. 



With the estimated correlation functions and the theo- 
retical model at hand, we can estimate the group bias &g 
by eliminating the galaxy bias b g from the Eqs. (8) and 
(9). For each mass bin, the bias is estimated by adjust- 
ing the normalization of u>ii n (r p , z e g) to the data points 
in the linear regime by means of a least-square fit, where 
z e s is for either correlation function the pair-weighted 
effective redshift of the corresponding sample. That is, 
for the correlation function between the samples G (or 
g) and g the effective redshift is defined by 



Zcff 



(20) 



where the index i runs over all objects of the sample G 
(or g), Zi is the redshift of the ith object, and N pa ,i ISi i is 
the number of pairs between the two samples that are as- 
sociated to object i within our maximal correlation scale. 
We weight the redshift (and other quantities related to 
a sample as a whole) by the number of pairs to account 
for the variation in the number of groups and galaxies 
with redshift, as the correlation function estimates are 
also pair weighted (see Eqs. (14) and (15)). The linear 
regime is marked by the filled points in the Figures 5 and 
6. Since the galaxy bias b g is eliminated, the resulting 
group bias 6q should be independent of the choice of the 
galaxy sample. 

Finally, we compute the mass Mb for the measured bias 
6q using the fitting formula from Tinker et al. (2010), 
which has an accuracy of about 6%. That is, we used 
the expression for b{v) in their Eq. (6) for the parame- 
ters in their Table 2, which is parameterized in terms of 
v(M,z) = 6 c /cr(M,z), where 6 C ~ 1.686 is the critical 
linear density for a spherical collapse and <j(M, z) the 
linear matter variance for the mass M at redshift z, i.e. 



a(M,z) 



1 

2^2 



k 2 V Un {k,z)W 2 (k,M)dk (21) 



with W(k,M) being the Fourier transform for a top hat 
filter of scale R = (3M/47T/5) 1 / 3 , where p is the comoving 
mean matter density in the universe. This gives us the 
relation b(M) between the linear bias and the halo mass 
at a given redshift, which can be numerically inverted to 
yield the relation M(b). Thus, for a given mass M we 
can compute the corresponding bias b and vice versa. 

To compare the resulting mean masses Mb that cor- 
respond to the estimates be to the independent fudge 
masses of the groups we introduce an "effective fudge 
mass" M e g for a given group sample. We could just av- 
erage the fudge masses of the groups. However, follow- 
ing Eq. (11), it is more correct to average the values of 
the bias that would be expected for the individual fudge 
masses, and then convert this mean bias back to a mass. 
Thus, we define the effective fudge mass M e g as the mass 
that corresponds to the pair-weighted "effective bias" 



b c s = 



J2i b(M (udge:i )N p 



(22) 



pairs , % 



where the index i runs over all groups within the sample, 
Mf u dgo,i is the fudge mass for the ith group, and N pa _irs.i 
is the number of Gg pairs associated with group i. 



23 http://www.icosmo.org 
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Figure 7. Consistency test for the estimation of group bias using 
the mock catalogs. b e ff denotes the "true" effective bias computed 
by Eq. (22) for the real masses of 2WM mock subsamples (see 
the text) and bQ is the corresponding group bias obtained from 
the correlation function analysis. The gray areas show the la re- 
gions of the means of bQ and 6 c ff as estimated for each of the 24 
mock catalogs. The consistency between the two estimates for the 
magnitude- and volume-limited galaxy samples demonstrates the 
robustness of our method. 

6.2. Consistency test with the mock catalogs 

To test this method, we perform an important consis- 
tency test for the subsamples of the 2WM groups within 
our mock group samples. Since we know the real DM 
halo masses for these groups (in the mock catalogs), we 
can estimate their "true" effective bias using Eq. (22) by 
substituting the fudge masses by their real halo masses. 
The comparison between the group bias 6g from the 
correlation function analysis for these 2WM groups and 
their "true" effective bias b e g is shown in Figure 7 for 
all samples. The gray parallelograms represent the esti- 
mated lcr region for the mean of the 24 mock catalogs 
in each mass bin. They take into account the scatter of 
6g from the 24 mock correlation functions and also the 
scatter of b e g and are these standard deviations divided 
by root-24. The fact that the recovered mean bias &g hi 
the 24 mock catalogs is always close to the actual bias 
b e g (averaged over the range of redshift and mass etc.) 
suggests that systematic errors (e.g. due to the uncer- 
tainty in the b(M) relation) are well below the random 
uncertainties associated with a single mock catalog, or 
with a single COSMOS-like survey field. 

We also explored (again using the 2WM mock subsam- 
ples) how the estimated bias depends on the maximum 
integration limit 7r max for the different galaxy and group 
samples. For the high-redshift bin, the corresponding 
curve is shown in Figure 8. For the low-redshift bin (not 
shown) the trends are very similar. The curves corre- 
spond to the mean of the 24 mock catalogs and the error 
bars show the standard deviation of the mean. It turns 
out that the bias generally depends only weakly on the 
integration limit. The bias of the volume- limited sample 
(red) is systematically slightly higher than that of the 
magnitude-limited samples (blue) and approaches the ex- 
pected bias closer (this is also obvious in Fig. 7). Another 
feature is the slight decline of the bias for 7T max > 15 h~ x 
Mpc, particularly for lower mass bins. For this reason we 
chose a maximum integration limit which is lower than 
the 20/i _1 Mpc adopted in Meneux et al. (2009) and de la 
Torre et al. (2011). The difference in the estimated bias 
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Figure 8. Group bias bQ as a function of the maximum integra- 
tion limit 7r max for the 2 WM mock subsamples in the high redshift 
bin. The different panels correspond to different mass bins accord- 
ing to the labels in units of log( A*f/A/;.;). The red curve corresponds 
to the mean bias of the 24 mock catalogs of the volume-limited 
galaxy sample and the blue curve corresponds to the mean of the 
magnitude-limited sample. The error bars show the standard devi- 
ation of the mean and the gray horizontal bar the lcr region of the 
mean of the "true" effective bias b c g (see the text). The vertical 
dotted line marks the adopted integration limit for this paper. 



changes, however, only about 10% between these two in- 
tegration limits, which is small compared with the error 
bar of the bias for a single mock catalog which is a/24 
times larger than the error bars shown in the Figures 7 
and 8. 

6.3. Resulting masses and discussion 

The resulting mean masses Mb that correspond to the 
estimates be for our group samples are shown in Fig- 
ure 9 as a function of the effective fudge mass M e s de- 
fined by means of Eq. (22). The masses Mb along with 
6g for the zCOSMOS groups are provided in Table 3, 
where the masses and the bias assume as = 0.9, but 
the latter can be easily scaled to any value of as a s 
bc{o~8) = (0.9/<78)&g(0.9). On the other hand, the result- 
ing masses are relatively insensitive to as as the change 
in bc(as) is counteracted by the mass-bias relation for 
different as- We checked that for as = 0.8, as indicated 
by recent measurements (e.g. Komatsu et al. 2011), the 
resulting Mb would decrease by ~ 0.05-0.1 dex with a 
stronger decline for the higher masses. 

The red points in Figure 9 correspond to the zCOS- 
MOS 20k groups, while the black points show, for each 
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bin, the resulting masses for the 24 mock catalogs. The 
relatively low effective fudge mass for the actual data in 
the high- mass bin at low redshift is due to the lack of high 
richness groups relative to the COSMOS light cones (see 
K12 for a discussion). The overall similarity between the 
estimated masses for the magnitude- and volume-limited 
galaxy samples underscores the robustness of the method 
and the expected independence of the analysis on the de- 
tails of the galaxy sample (the magnitude- and volume- 
limited samples differ by a factor of about two in number, 
see Tab. 2). The difference between the estimated masses 
are only about 0.02 dex for the high-redshift range and 
<0.15 dex for the low redshift range (cf. Tab. 3). 

The linear bias b{M) is expected to be a strongly non- 
linear function of log(M/M©). At redshift z = 0.7, for 
instance, it sharply increases for M > 10 135 Mq and flat- 
tens for M < 10 12 M Q (e.g. Pillepich et al. 2010; Tinker 
et al. 2010). At lower redshift, these features move to 
slightly higher masses (i.e. the shift is about 0.5 dex un- 
til z = 0). These features in the b(M) curve will, for 
instance, produce non-Gaussian distribution of the in- 
dicated mass for a Gaussian observed uncertainty in b. 
For this reason, we plot the distributions of the indi- 
cated Mb for the 24 mock catalogs instead of error bars 
on the mean, which would be difficult to interpret. Not 
surprisingly (cf. Fig. 7), the measured bias &g for the 
mock catalogs are, nonetheless, quite symmetrically dis- 
tributed around the corresponding effective bias b e g de- 
rived from the fudge masses (see Eq. (22)), but are typi- 
cally 5%-10% higher than b e g. This is consistent with the 
systematic overcstimation of the correlation function for 
reconstructed mock groups compared to that for 2WM 
groups as shown in the Figures 3 and 4. 

The estimated mean masses Mb for the zCOSMOS 
groups are broadly within the scatter of the 24 mock 
catalogs. They increase with fudge mass as expected by 
the theory of cosmic structure formation. This effect was 
first clearly measured by Bahcall & Soneira (1983) using 
Abcll clusters and recently in SDSS in the low-rcdshift 
universe with a detailed comparison to the ACDM cos- 
mology (e.g. Berlind et al. 2006; Wang et al. 2008). The 
masses Mb for the zCOSMOS groups are, however, gen- 
erally larger than the average Mb for the mock catalogs 
reflecting the high amplitudes of the correlation functions 
in Figures 5 and 6 (i.e. the measured bias is typically 
about lcr-1.5(T higher than the corresponding mean bias 
in the 24 mock catalogs). An exceptionally large bias is 
measured for the highest mass bin at high redshift, this 
being 15% higher than the largest bias seen in the cor- 
responding 24 mock samples. This is an about 3a effect. 
Adopting a lower og of 0.8 would mitigate the discrep- 
ancy by decreasing the corresponding Mb for the actual 
data by ~ 0.1 dex, but it would still be higher than in any 
of the 24 mock catalogs. The reason for this unusually 
large bias measurement might be the huge structure at 
redshift z ~ 0.7 (sec Fig. 1) in the COSMOS field (see 
also the discussion in Meneux et al. 2009). In fact, of the 
24 groups in this mass bin, 19 lie in the redshift range 
0.65 < z < 0.75 and the remaining five groups are at 
z < 0.55. Thus, the corresponding correlation function 
is almost entirely dominated by the structure at z ~ 0.7. 

Thus, we conclude that in total our finding is essen- 
tially consistent with simulations. Although this analy- 



sis provides an important overall consistency check, the 
rather large error bars emphasize that these correlation 
functions do not provide precise estimates of the halo 
masses of particular tracers unless very large samples in 
very large fields are available. There are, however, in- 
dications that the structures observed in the COSMOS 
field correspond to rather rare realizations of the mock 
catalogs. The relatively high values of the bias reflecting 
the large amplitudes of the measured correlation func- 
tions are in general agreement with previous studies (Mc- 
Cracken et al. 2007; Meneux et al. 2009; Kovac et al. 
2010; de la Torre et al. 2010, 2011), which reported un- 
usually strong clustering in the COSMOS field. 

7. CONCLUSION 

We have performed a group-galaxy cross-correlation 
analysis in two redshift bins (i.e. 0.5 < z < 0.8 and 
0.2 < z < 0.5) using the high-quality zCOSMOS group 
sample cross-correlated with two spectroscopic zCOS- 
MOS galaxy samples. The aim was to perform a con- 
sistency test between the clustering strength of groups 
and their masses that had previously been estimated on 
the basis of the observed richness of the groups. To com- 
pute the group bias 6g w ^ measured the cross-correlation 
function between groups and galaxies and eliminated the 
galaxy bias 6 g by also measuring the galaxy autocor- 
relation function. The analysis was carried out using 
both magnitude- and volume-limited galaxy samples to 
demonstrate the robustness of our estimates. 

The mock catalogs are a valuable tool to explore the 
systematics of the methods and samples. A comparison 
of the cross-correlation functions for real mock groups 
and for all reconstructed mock groups shows that our 
group catalog is suited for a correlation function anal- 
ysis, but that we overestimate the correlation function 
for real groups by about 20% due to imperfections in the 
group catalog and in particular the spuriously identified 
groups. We also performed a consistency test with the 
mock catalogs to test our method and found largely con- 
sistent results between the clustering strength and the 
group masses. 

The resulting galaxy autocorrelation and group-galaxy 
cross-correlation functions for the actual 20k zCOSMOS 
group samples follow, as expected, approximate power 
laws in the linear regime which assures that the estimated 
bias is a meaningful quantity. At high redshift the ampli- 
tude of the galaxy autocorrelation function is rather high 
compared with the results from the mock catalogs and at 
both high and at low redshift its slope is unusually shal- 
low (for the high-redshift bin this was already noted in 
Meneux et al. 2009; de la Torre et al. 2010). We checked, 
however, that there are individual mock catalogs within 
our ensemble set of 24 which exhibit similar autocorrela- 
tion function amplitudes and slopes. This reassures that 
the zCOSMOS sample with respect to clustering can be 
regarded as a slightly untypical mock cone, where the 
peculiarities are simply due to cosmic variance. 

The measured bias for the zCOSMOS groups increases 
with group richness as expected by the theory of cosmic 
structure formation and yields masses that are reason- 
ably consistent with the masses derived from the richness 
alone, considering the scatter that is obtained from the 
24 mock catalogs. Only the bias for the highest mass bin 
at high redshift is significantly higher than seen in any 
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Figure 9. Estimated mean masses Mb from the correlation function analysis as a function of the effective fudge mass M e g within the 
corresponding mass bins (see Tab. 1). The red points show the 20k groups and the black points correspond to each of the 24 mock catalogs. 
The different mass bins are well separated from each other in terms of M c ff and the galaxy sample that was used is indicated in the panels. 
Equality of Mb and M e g is marked by the dashed line. 



Table 3 

Measured values of the bias b G and corresponding mean masses Mb for the zCOSMOS 



20k 


groups 






0.5 < z < 0.8 




0.2 < z 


< 0.5 


Mass bins 12.5-13 13-13.5 


>13.5 


<13 


>13 


Magnitude-limited galaxy samples 








b G 1.64 ±0.36 2.12 ±0.33 
M b 13.25 ±0.91 13.61 ±0.31 


4.16 ±0.48 
14.39 ±0.21 


1.52 ± 0.25 
13.45 ± 0.47 


1.97 ±0.28 
13.84 ±0.28 


Volume-limited galaxy samples 








b G 1.64 ±0.39 2.14 ±0.33 
M b 13.24 ±0.67 13.61 ±0.31 


4.30 ± 0.43 
14.41 ±0.20 


1.39 ± 0.18 
13.31 ± 0.44 


1.81 ±0.25 
13.72 ± 0.24 



Note. The masses are in units of log(M/M0) and we explicitly assumed h = 0.73 
and (78 = 0.9. For another choice of erg the bias can be easily rescaled as &g(°"8) = 
(0.9/crg)feQ(0.9). The error bars correspond to the standard deviation among the 24 mock 
catalogs. For the definition of the galaxy samples see Table 2. 
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of the 24 mock catalogs, which corresponds to about a 
3cr effect. This can likely be attributed to the extremely 
large structure that is present in the COSMOS field at 
z ~ 0.7. The small differences between the estimated 
masses for the magnitude- and volume-limited galaxy 
samples demonstrate the robustness of our result at low 
and at high redshift. 

In total we find overall fairly consistent results between 
the zCOSMOS sample and numerical simulations, al- 
though there are indications that the structures observed 
in the COSMOS field correspond to rare realizations of 
the COSMOS light cones. Our measured values of the 
bias are systematically larger than on average within the 
simulations, which reflects the unusual strong clustering 
in the COSMOS field that was reported in previous stud- 
ies (McCracken et al. 2007; Meneux et al. 2009; Kovac 
et al. 2010; de la Torre et al. 2010, 2011). 

This research was supported by the Swiss National 
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Very Large Telescope (VLT) under the Large Program 
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